MASSACHUSETTS INSTITUTE OF TECHNOLOGY 
A. 1. LABORATORY 


Artificial Intelligence 

Memo No. Z \9 August 19*71 


INFORMATION THEORY AND THE GAME OF J0TTQ 
Michael &eeler 


Work reported herein was conducted at the Artificial Intelligence Laboratory* 
a Massachusetts Institute of Technology research program supported in part 
by the Advanced Research Projects Agency of the Department of Defense and 
rnorn'tored by the Office of Naval Research under Contract Number NQQQ14- 
70-A-O3&2-0CO2. 


The wpr d gamp; JOTTQ, ha 5 attracted the Interest of several 
computer programmers over the year$ t not to mention countless devoted 
piayers, Rules ares 

1. Each of Z players selects a Splatter English word, not a proper noun, 
as hTs Tr secret word*" 

2. Play consT-sts of alternate turns of naming a “t^st word ,' 1 whose - 
constraints afe the same as on the secret words, and the opponent 
answering how close the test word Ts to his secret word. 

3. Closeness ts measured in jots; each jot Is a one-to-one letter match, 
and Independent of which word Ts the test word. GLASS versus SMILE 
or SISSY Is 2 jots, 

4. The first player to guess his opponent’s secret word wins. 

Constraints on a JQTTQ program arei 

First, It must have a dictionary of ail possible words at the 
outset of each game. CThe modification of addins newly experienced words 
to its dictionary Is trivial In practice and not worth the programming 
effort, especially since one wants to avoid adding word -1 Ike typing 
errors, etc.) The {unacceptable) alternative Is to have a letter¬ 
deducing algorithm and then a “’word proposer" to order the 5 factorial ■ 
l£d combinations {perhaps based on digram frequencies and vowel 
constraints) once all 5 letters ere found. 


Second, the most use the program can make of the jots from a 
given test word is to eliminate from its list of "possible secret words 
of opponent" all those which do not have that number of jots against that 
test word. Hence, each test word should be chosen to maximize the 
expected number of words eliminated, J.e,, maximi re the expected 
Information derived. If test word A divides the list of M possible 
secret ivords Into pO words of no jots, nl of 1 jot, etc, and all words Tn 
the list are equally likely to be the opponent's real secret word, then 
we expect 5 

“ ' TU 

bit 5 of Information 


£ tM-h 


from giving test word A. [See Pierce, "Symbols, Signals, and Noise," 
page 84*) The bes-t test word to give Is the one (out of all those In the 
dictionary) which gives the highest number of bits expectation, 


And third, since the program is to interact with humans, a 
certain amount of human engineering is desirable. For example, many 
humans feel cheated If the computer (or even another human) wins because 
its word is Inordinately obscure; nevertheless, they do not want to be 
limited in their own choice of secret words. Thus CRWTH Ts fair gamE for 
humans, but not for computers! 

A JGTTQ program has existed for a couple of years at MIT's A,I* 
Lab, which program seems to have survived the test of time and thus 


■deserves documentat i on* this program will be the subject of the test of 
Eh f s memo. 

JOTTO Is written in machine language (MIDAS) for the PDP-6/1Q, a 
computer whose Instruction set is very symmetrical, complete, powerful 
and easy to think in. It takes about 121*9 words of core, plus 5 words 

for each of about 7000 dictionary entries* 

Since it employs the 1nformaclengthenry criterion described above 
for te$t word choice. Its sequence of test words ]n guessing a given 
secret word Is fixed. This allows pre-computing, storing and table look¬ 
up of the first few (usually three) test words. The game tree is stored 
to the depth where the list of possible words Is 1(10 or less. Tor 
example, its first test word is always TEA^S, 

There are two second-order effects In choosing test words, 
neither of which Is dealt with Tn any way In the program. The first 

regards permutation groups* flnce all 5 Tetters ere known, alt one can do 

Es sequence through all anagrams (the program does It randomly)* Thus, 

If It takes several test words to narrow the field down to one 
particularly large permutation group, play might he substantially 
Improved by some scheme such as weighting the Information contribution of 
words exponentially as the size of their permutation group* This seems 
unnecessary fn practice, however, since the large permutetIon groups are 
'^ound fairly early In the game tree. 

The other effect |s somewhat subtler, Jt is conceivable that the 
test word with highest expectation at the current point in the game has a 
good chance of getting us to a point where we will NOT have ary 
particularly good test words available. That 3s, 1t may be worth trading 
a fraction of a bit on this turn for a 1 arger (-on the average) fraction 
of a bit on, say, our next turn* ! am Indebted to Bill Gosper for 
pointing out this possibility; the computation, required, however. Is 
Impractical, and besides, the program seems to do acceptably as Is* 

The previously mentioned dilemma regarding C&WTH Is resolved by 
storing with each word one additional bit which says common/uncommon* 

The program never selects an uncommon word for its secret word* Since 
the dictionary Is about 50/50, this amounts to giving the human an 
advantage of about I bit* This is not terribly large compered to the 2+ 
bits per test word the program gets; the program seems to hold its own 
anyway. Also the program never uses an uncommon word as a test word, 
which can also handicap It slightly, but one would not exnect much 
improvement over the best test word among the 3SOD common words. (This 
is borne out by best conror versus best overall test word comparisons 
made In compiling the pre-stored opening test words.) An exception t& 
when aH 5 letters are known; then guessing Is random among all 
permutations which are fn its dictionary, irrespective of the 
co^nnon/uncommon bit. At this point it changes its phraseology from, "MY 
ZEST WORD: TEAMS’ 1 to, " IS YOUR WORD CRWTH?'* 


The dictionary was compiled from Webster's Seventh New- 
Collegiate* A few mistakes have heen found; the program doesn't know the 
word STUCK* These errors have been accumulated, sod will (1 hope) soon 
he corrected* Also aiding the editing of the dictionary are cornier i sons 
with Steve Brown’s jotto dictionary at the MIT RLE PR P-1 (apparently 
mostly compiled from Webster* 5 Mew World Dictionary, College Edition), 
and George A* Miller's National Institutes of Health jot to dictionary 
(complied from Funk and Vtagnall's unahrfdged). 

Dictionary errors are ameliorated by the philosophy behind the 
program: polite l/o hiding powerful computation, it gives a one-line 

explanation when loaded, and at the beginning of each game asks, "WOULD 
YOU LIKE TQ GG FIRST?" AY or !l is sufficient to answer any question It 
asks* When either player wins. It asks whether you want to continue 
guessing Its secret word (or to have It continue guessing yours). it 
does not keen score or gloat over Its wins* If you type a non-word at 
It, it asks, "ARE YOU SURE THAT'S A WORD?" and Y lets you cheat, (Or you 

can Interpret the contents of location SECRET as slxbit and cheat that 

way!) Rubput deletes all of an fncompTetely-typed test word* Typing "?" 
when it's your turn to give a test word makes It ask, "WANT TO GIVE UP?" 

Ope of the nicest features to add would be a back-up* so that 
when It says, "I GIVE UP* DO YOU WANT TO TRY TO GUESS MY WORD?" you can 
say, "Oops ■*- f meant to give you 3 jots on TEARS instead of 2 *.* 
sorry*" Because of the sequential word e)EminatIon* this would 
necessitate a small wait while the updated jot sequence was re-processed. 

Another feature would be reversion to the letter-findIng and 
permutation-guess Ing strategy Instead of just giving up when It finds the 
list of possible secret words has shrunk to aero* This, however. Is not 
worth the pain of programml ng 3t. Yet another "feature" would he to say* 

,f l GIVE UP* WHAT WAS YOUR SECRET WORD? —-*- YOU RAVE ME 2 JUTS ON 

TEARS AND THAT HAS 3 JOTS, GRIPE, GRIPE *,," 

Each time a new game [5 started, ITS, the timesharing system 
under which JOTTO runs. Is called, and the binary values of the current 
date, time of day (both In units of 4.OCR microseconds and Tn slxblt 
HFIMMSS), and run time of this copy of the JOTTO program are XOR-ed 
together* This pseudo-random quantity Is then used to select the 
program's secret word. 

Programmers; please use or Implement a feature to list words 
alphabetically down each column per page Instead of across each row! 

The Improvement in readability Ts at least 1000%, 



